## Using rownames(importance_df) as id variables
Endpoint has data from Day 2,3,10 - might be misleading since contains CFU from different data points
RF all mice/days together RF cfu vs community by day
## Using rownames(importance_df_day) as id variables
## Using rownames(importance_persist) as id variables
## Using rownames(importance_persist_day) as id variables
Eliminating mice euthanized early from the RF model gives similar R^2 and MSE, bu there is a slight advantage to community OTU features of Day 0 to predict Day1 CFU. As well as the R^2 value increases with increasing features, whereas when all mice are used the day 1 R^2 only decreases with increasing features. Of note, accompaied by this is an increase in the % MSE attributed to OTU15 (Akkermansia), which has seemed to stand out in all other days/analysis.Interestingly, akkermansia does not appear to have the same relationship when compared to the same day cfu and community. This could suggest akkermansia is promoting the intial colonization of c difficile. At first glance, OTU135 (Coriobacteriaceae) seems to stand out for predicting cfu of the same day as well as from day 0.